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Ontology sense-making or visual comprehension of the ontological schemata 
and structure are vital for cross-validation purposes of the ontology increment 
during the process of applied ontology construction. Also, it is important to 
query the ontology in order to verify the accuracy of the stored knowledge 
embeddings. This will boost the interactions between domain specialists and 
ontologists in applied ontology construction processes. Hence existing 


Keywords: mechanisms have numerous of deficiencies (discussed in the paper), a new 
; algorithm is proposed in this research to boost the efficiency of usage of tree- 

Algorithm maps for effective ontology sense making. Proposed algorithm and prototype 

Ontology are quantitatively and qualitatively assessed for their accuracy and efficacy. 
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1. INTRODUCTION 

In ontological taxonomy or structure assessment ’visualization compactness ‘is an insolvable 
problem, since the screen size will become a fixed barrier [1], [2]. However, visualization clarity can be 
enhanced via rational blend of appropriate visualization techniques [3]. In applied ontology construction 
processes non-computing domain specialists also contribute for the knowledge modelling aspects. As claimed 
by [4], visualisation complexity is an adversely affecting bottleneck, which hinders their (i.e. domain 
specialists) active contribution to the process of applied ontology construction. [5] claim visualisation as a dire 
necessity for ‘ontology sense making’. Hence, it is an active research niche, where researchers examine for 
better visualisation techniques to facilitate effective ‘ontology sense making’. This paper discusses the existing 
challenges associated with ontology visualisation and proposes a resolution to overcome existing challenges. 
As elaborated in the related work section beneath, among multiple visualization techniques, tree-maps are 
commended for their drill down enabled traversal facility and knowledge abstraction aspects contributing for 
the simplicity [6], [7]. Hence, the main objective of this research is to propose a fully automated tree-map 
generation algorithm, which can function regardless of the domain or the schemata to facilitate ontology sense 
making. The proposed solution is evaluated by both domain specialists and ontologists, with the intension of 
providing a refined reflection. 
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2. RELATED WORK 
2.1. Challenges 
2.1.1. Scale vs. amount of information to be displayed 

Displaying all significant information, without mentally overloading the stakeholder is still an open 
challenge [8]. As a result of increased information density, visual clutter and occlusion occur. This makes the 
ontology sense making a complex task. Refer Figure 1 for more details. 
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Figure 1. Occlusion and clutter in visual canvas 


2.1.2. Cognitive complexity 

When displaying a complex ontology schema, presentation canvas become extremely clotted and 
complex with excessive consumption of screen space. This will result in lot of scrolling vertically and 
horizontally, in order to trace the required region of the ontology [8]. 


2.1.3. 3D vs 2D. 
3D is not a good solution for ontology visualization requirements, since it causes stress escalation and 
cognitive overload to the users [8]-[10]. 


2.1.4. Learning curve & information loss 
Use of techniques such as “Eular diagraming” results in information loss and additional 
comprehension barriers. Moreover, there is an integrated learning curve associated with the “Eular concepts” 


[11]-[13]. 


2.2. Existing visualisation methods 
2.2.1. Graph-based methods 

This is the most popular technique associated with ontology visualisation. Most of the users are 
familiar with this technique. The drawback is such that, when the ontology becomes complex, clutter and 
occlusion will occur as depicted in Figure 1. Henceforth with lots of nodes and edges crossing each other, this 
will make the presentation canvas cognitively complex to comprehend. Refer Table | to recognise several of 
ontology visualisation plugins that use this method [8], [14]-[16]. 
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Table 1. Visual plugin comparison 


Plugin Category Pros Cons Reflection 
OntoViz [17], Graph-based Offer a good overview of Easily outgrow and Higher possibility of becoming 
[18] the hierarchy of canvas occlusion occur a cognitively complex visual 
connections representation for domain 
specialists. 
OntoSphere Graph-based Clear representation of a Easily outgrow and Higher possibility of becoming 
[17], [19] 2D hierarchical graph and _ canvas occlusion occurs. a cognitively complex visual 
3D sphear view. Different Extractions and representation for domain 
colour codes for different retractions are not specialists. 
nodes. Zooming available. _ possible 
TG Viz Tab Layout-based _It provides a global No relational links Higher possibility of becoming 
[20] overview and representation. Chaotic a cognitively complex visual 
recommended for quick view of the hierarchy. representation for domain 
browsing. Based on focus- specialists. 
directed mode coming as a 
specialization in the 
graph-based category. 
Extraction-retraction 
possible. 
Jambalaya Graph-based Two visualization modes Extractions and By introducing more user- 
[21] and layout- are available as the tree- retractions are not friendly interaction methods, 
based map version (layout- possible. Movements, the tree-map view of Jambalaya 
based) and focused- rotations are not can be enhanced to work as a 
directed version (graph- supported. Graph-based more comfortable platform for 
base). The tree-map view view easily outgrows domain-specialists as well. 
addresses the scale vs causing canvas occlusion. _ Because it’s the only 
information problem visualization mode, which can 
significantly. address the scale vs information 
overload problem. 
Glow [22] Euler Capable of representing Most of the users are not The learning curve is high and 
diagram- the hierarchical familiar and mathematics _ could not be suitable for 
based relationships in a circular are required to domain-specialists 
view. comprehend some aspects 
Swoop [23] Euler Capable of representing Cardinality and property The learning curve is high and 
diagram- the hierarchical information can be lost. could not be suitable for 
based relationships in a circular Easily outgrow and domain-specialists 


canvas and mathematics 
are required to 
comprehend some aspects 


view. Disjointness also 
can be presented via non- 
overlapping of circles 


2.2.2. Layout based methods 

There are multiple sub-categories of layout-based methods available for ontology visualisation. These 
categories are namely; force-directed, radial, inverted-radial, circular and tree-maps. Radial, inverted-radial, 
and circular layouts are criticised for their excessive space wastage, rotated text representation and loss of 
hierarchical structure, which cause additional cognitive overload on the user for the realizations [6], [7]. 
Similarly, force-directed method also causes information losses [6]. 

However, usage of tree-maps has been acclaimed due to its ability of effective utilisation of the 
visualisation space available, provision of traversal experience to the users via drill-down interaction and the 
effective rationale of layered information handling. Moreover, all these features contribute towards effective 
handling of “scale versus information problem” that was already discussed. Hence, the tree-map layout has 
been effectively used for the visualisation of the concepts of Gene Ontology as many users have commended, 
due to its capability of reducing the cognitive overload [6], [8], [15], [24]-[26]. 

Furthermore, tree-maps preserve the hierarchical organisation of the information schemata and make 
the comprehension easier even for the non-technical specialists facilitated via the traversal interaction 
experience [6], [8], [15], [24]-[26]. Multiple literatures [27]-[29] have depicted that, tree-maps visualisation 
techniques can be fine-tuned to address most of the aforementioned requirements, since these are identified as 
a layout-based visualisation mechanism with the screen space-filling capability. Therefore, each pixel in the 
visualisation of a tree-map contributes to the information representation [28] with another value-added feature 
of ‘content-aware resizing’ [27]. Therefore, researchers claim that, with the proper configuration of the tree- 
maps, it will be a fully dedicated visualisation artifact for the end-users, assisting effective “ontology sense- 
making” [27], [28]. Also, a variety of applications of tree-maps in domains such as stock-market analysis and 
genetics have justified their ability to provide abstract representations, overcoming the cognitive barriers 
associated with comprehensions [8]. Furthermore, researchers have justified that, it will take lesser time to 
comprehend a tree-map based conceptualisation, compared to those of other modes of visualisation since its 
layer-based dimensionality reduction is accomplished though cognitive walk-through facility [28], [30]. 
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However, one of the limitations of the tree-map is that, the information fed into the tree-map 
visualisation artifact needs to be properly grouped in a fine-grained manner enforcing the domain constrained 
mappings existing in the conceptualization. Otherwise, none of the afore-mentioned advantages of the tree- 
maps can be leveraged [6], [8], [15], [24]-[26]. Refer Table 1 for a list of layouts method-based plugins. 


2.2.3. Eular diagraming method 

In relation to the Euler diagram-based methods, researches have claimed that, it is a novel 
representation mechanism, where the majority of the user-bases might not be familiar, in contrast with other 
visualisation categories. Furthermore, it has been pointed out that, there is a tendency of information loss in 
representing data associated with certain domains (i.e., property associated information). As a remedy of 
overcoming this information loss, in addition to the Eular diagrams interpretations, mathematical 
representations are also needed. Additionally, it is being suggested, to boost the understanding of the 
representations, piercing theory will assist end-users, even more in accurate comprehension of axioms 
presented in Euler diagrams. Therefore, it is identified as technically complex and the learning curve to be 
comparatively high in learning Euler diagrams and piercing theories. Therefore, proper comprehension of the 
knowledge representations, is definitely going to be an additional overload to the end-user [11]-[13]. Refer 
Table | for a list of Eular-method based plugins. 


2.3. Reflection 

According to the afore-mentioned discussion, it is apparent that there are challenges to be resolved in 
order to escalate the visual sense making clarity of the ontologies. Among the methods reviewed, other than 
the tree-maps, all other methods have several critical deficiencies. Even in tree-maps there is a necessity to 
properly group and feed the information. Or else provision of the afore-mentioned advantages of the tree-maps 
will not be feasible. Therefore, it can be concluded that, tree-maps are a potential solution to represent 
ontological schemata in a comprehensible manner. However, there needs to be a specifically defined algorithm 
to organise the schematic mappings of the ontology without any information loss, before feeding to the tree- 
map chart framework. Proposed algorithm will make tree-map generation for the ontological schemata fully 
automated and manual configurations free. The remaining section of the paper discusses, how the afore- 
mentioned objective is accomplished. 


3. RESEARCH METHOD 

Design science research methodology [32] is customised and used for this research. In literature it is 
stated that, design science research methodology is ideal for investigating human centered interventions [31 ]- 
[33]. Ontological sense making via visualisation is also a human-centered activity initiated by both domain 
specialists and ontologists. Hence, it is concluded that, design science research methodology will be a suitable 
choice for this research. Customised version of the design science research methodology, has been utilized in 
this research as elaborated in Figure 2. 
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Figure 2. Customised version of the design science research methodology 


As already depicted in Figure 2, the first step of the design science research methodology is to literary 
justify the problem of concern. This step has been already accomplished as per the contents discussed in the 
related work section. Second step is to derive on a potential solution. Again, as already conversed in the 
literature review, efficacy of the tree-maps is literary justified against the other alternatives. Hence, it's decided 
to use tree-maps layout approach as the potential solution proposed for this research as well. In the third step, 
depending on the outcomes of the step one and two, an algorithm is designed to feed information to the tree- 
maps. Subsequently the designed algorithm is developed using java to exercise it on the real-world use cases. 
The proposed algorithm is applied on three different domains to verify it's domain and schema independent 
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capability. The steps followed in algorithm evaluation is methodically discussed in the evaluation section. 
Ultimately, iterative framework [34] is utilized to derive a final verdict about the algorithm's capability to 
function in a domain and schema independent manner. Evaluation procedure utilized is elaborated in detailed 
in the evaluation section of this paper. 


4. RESULTS AND DISCUSSION 

High level communication flow of the proposed solution is represented in Figure 3. Flow of the 
algorithmic execution is illustrated in Figure 4. The first step of the algorithm is to extract classes, data 
properties, object properties, relationships with mappings preserved. This will assure, no mapping associated 
information in the ontology will be lost. Henceforth, all the extracted information sets are stored in a database 
as per the mappings residing in the ontology. 

Once this step is completed, tuples can be extracted one-by-one according to the process illustrated in 
Figure 5. This process is iterative until all tuples are processed. Tuple associated information are passed to 
‘infocollectorTreemapRpt’ method with a flag label claiming the type of the information sent to the method. 
This flag label acts to control conditional execution of the relevant code snippets associated with the appropriate 
conceptual aspects of the ontology (i.e., disjoint relationship), provided from the database tuple. 


<n o | 


Figure 3. High-level communication flow 


Figure 4. Algorithmic execution flow 
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Figure 5. Tuple extraction logic 


Inside conditional code snippets, first statement is for the definition of the header layout of the tree- 
map. This will define one layer in the tree-map for the concept of concern (i.e. disjoint relationship). 
Additionally, level associated information is also needed to be stored in order to ensure smooth back-and-forth 
traversal, facilitating the cognitive walkthrough. This entire process is elaborated in Figure 6. 

Once all information of the ontology file is packaged and organised as in Figure 6, that information 
will be passed into the ’generateReport ‘method. This method is responsible for the creation of the HTML 
version of the organised contents. This entire process is illustrated in Figure 7 code snippet. After examining 
on multiple charts frameworks, finally, Google charts framework is selected for the purpose of this research 
(i.e. charts.js framework doesn't have tree-maps and D3 charts framework’s syntax and configurations are very 
complex compared to the Google charts framework). Google chart’s java script logic is separately maintained 
as an independent html file. Database contents are iteratively appended to another html file (i.e. Figure 7). 
Eventually, both these html files are merged together to derive the final functioning tree-map layout as depicted 
in Figure 8. 
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Figure 6. Tree-map layer organization logic 


jeE is ara ota angus eRayay i Scrinwnt er ah, Goes Sag) thee eee 
1 


erties frewanring 275 


‘ inject Google chores jevorcpt 
ie jomecy Pw 


tle Gorltarpi tee mee Pole" Lary ie | 
“TwPeTTinsgy LDS TDS. eee firm! para Dee i pes 


Stet cing ies: 


22 }Elepe=" inher rhacece™| qwoile deteboe cones be the hon! ile 
j 
Dobfecedaricer wlite= = now DubfesecdNsitec ices Foleteitec (ET 


cin 


wolbereppaind Ihoalocring); Append Gon) orrclogy cones and Goole charts jis sorpe bo 
peer te Lie final hee map 


Figure 7. HTML version of the tree-map file 
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Figure 8. Final functioning tree-map layout; (a) Functioning tree-map-at high level, (b) Functioning tree- 
map-drill down enabled 


5. EVALUATION 

For the evaluation process six domain experts and three ontologists were independently selected. 
These domain specialists are from psychotherapy domain (two consultant psychiatrists), labor law domain 
(two lawyers) and marine biology domain (two biologists) respectively. All these six members were involved 
in ongoing ontology creation projects as domain specialists. Hence, they are equipped with an extensive 
knowledge about the taxonomy and schematic organisation of those respective three ontology increments, 
belonging into afore-mentioned three domains. As the first step, all of them were educated about this prototype 
developed and its applications via a workshop. Henceforth, they were requested to use this prototype in their 
latest ontology increments and comment on the opinions about the output as correct or incorrect. 

All correct instances are classified as true positives and wrong instances (i.e. situations where 
taxonomic structure was not depicted accurately) as false positives. Henceforth, those values are used to 
calculate precision, recall and F-measure accuracy matrices. Domain specialists were informed to decide their 
opinions and log them as true positives or false positives. They were given a time duration of one week to 
complete this activity. By the end of the week, they were provided with a questionnaire and a specialised grid 
to rate their genuine experience with the prototype. The grid structure provided for capturing the ratings are 
depicted in Figure 9. 
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Figure 9. Rating grid 


Questionnaire provided to the domain specialists contained questions on functionality of the 
prototype, accuracy of the prototype and the effectiveness of the proposed concept. The quantitative opinions 
provided by them and the accuracy matrices calculated are tabulated in Table 2 and Table 3. 
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Table 2. Quantitative matrices on domain specialists’ opinion 
Domain Specialist-Domain of interest Functionality Accuracy Effectiveness 


A - Psychotherapy 80% 80% 70% 
B- Psychotherapy 70% 80% 80% 
D- Law 70% 80% 80% 
E- Law 80% 90% 70% 
F- Marine Biology 90% 80% 70% 
G-Marine Biology 80% 80% 80% 
AVG 18% 82% 715% 


Table 3. Averaged Accuracy matrices 
Accuracy Matrices — Consolidated Results 


Precision Recall F-Measure 
0.95 0.92 0.93 


Without discontinuing from the quantitative assessment, all the domain specialists were compelled to 
take part in a controlled interview session. The output of the controlled interview sessions conducted was 
summarised and tabulated in Table 4 (salient points extracted via doing a thematic assessment). 

In the second phase of the experiment, three ontologists were invited. They were briefed regarding 
the technical inner workings of the prototype while the results derived from the domain specialists’ experiments 
were also exposed to them. Henceforth, they were also given the same grid to rate their opinions about the 
proposed prototype. However, their questionnaire contained questions on effectiveness of the prototype, 
novelty of the prototype and the architecture of the prototype. After collecting their quantitative assessment, 
they were also compelled to take part in a controlled interview session for deriving their qualitative reflections. 
The information collected are tabulated in Table 5 and Table 6. 


Table 4. Qualitative reflections of domain specialists 
Parameter Viewpoints 
Functionality — overall 78% 1 Useful for knowledge blending 
2. Drill down facilitates understanding 
3 Can traverse across the ontology 
1. Dimensionality Reduction and easy comprehension 
2. Simplicity and layered abstraction 
3 
1 
2 
3 


Accuracy - overall 83% 


Quick & accurate 

Good to develop collaboration aspects as needed in ontology construction. 
No need of technical configurations 

Domain and schema independent. 


Effectiveness — overall 77% 


Table 5. Quantitative matrices on ontologists opinion 
Ontologist Effectiveness Novelty Architecture 


A 80% 90% 90% 
B 70% 90% 80% 
Cc 80% 80% 80% 
AVG 717% 87% 83% 


Table 6. Qualitative reflections of ontologists 
Parameter Viewpoints 
Effectiveness-overall 77% . Layered abstraction of information 
. Good solution for scale vs information density 
. Less susceptible for occlusion 
. Drill down enable cognitive walk-through 
schema independent 
. Domain independent 
. No datasets needed for model training 
. Works with any domain 
. Cognitive walkthrough 
Light weight operation 
. Algorithm links the Google charts framework with the ontology 
. No need for configurations 
. Schema agnostic operation 
. Domain agnostic operation 


Novelty-overall 87% 


Architecture-overall 83% 
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Eventually iterative framework was utilised to assess the objective accomplishment of the entire 
research (independent ontologists was invited for this task). Iterative framework is an established framework 
for opinion mining [34]. In this research, it is used to assess the accomplishment of the research objective of 
this research. These details are tabulated in Table 7. Entire evaluation process is depicted in Figure 10. 


Table 7. Iterative framework’s results 
Iterative Framework Step Justification Elaborations 
01 What are the data telling me? —_In quantitative experiment conducted, domain specialists have given a 78% of averaged consent 
for the proposed resolution. 


Ontologists have looked at the prototype in a more technical perspective and they also have given 
an overall consent of 82% for the proposed resolution. 


Additionally, the qualitative feedback provided (i.e. summary is documented in tables 3 and 5) by 
both domain specialist and ontologists also depicts positive attributes about the entire framework, 
and it’s workarounds. 


02 What do I want to know? How effective is the proposed visualization resolution in accomplishing the stipulated research 
objective? 
03 Is there a dialectical Yes. 
relationship in step 01 and Both qualitative and quantitative results yielded has depicted the efficacy of the proposed 
step 02? resolution, in terms of the research objective to be addressed. 


Therefore, it can be concluded the proposed solution is satisfactory at its current state and research 
objective is accomplished 


Figure 10. Entire evaluation flow 


6. CONCLUSION 

It was already conversed that there is a requirement for a sensible solution for effective ontology sense 
making. Existing methods had numerous challenges as already discussed in the section for related work. 
Among all existing approaches, tree-maps seem to provide a sensible resolution to this problem. However, in 
order to gain the real advantage from the tree-map, information extracted from the ontology needs to be fed to 
the tree-map, preserving the taxonomic mappings and schematic information in the ontology. The objective of 
this research is to propose an algorithm to accomplish that shortcoming. Proposed algorithm is capable of 
extracting all required information from the ontology file and methodically feeding it to the tree-map without 
loss of information (refer Tables 2-6 as justifications). Therefore, this has resulted in effective tree-map 
operation for ontology sense making, as already justified through the evaluation outcomes as well (refer 
Tables 2-6). As for future recommendations, influence will be given to further enhancement of the human 
computer interaction aspect of the proposed tool. 
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